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BEYOND HIRSCH CONJECTURE: WALKS ON RANDOM POLYTOPES 
AND SMOOTHED COMPLEXITY OF THE SIMPLEX METHOD 

'ob" ■ ROMAN VERSHYNIN 

o: 
o ■ 

^s^ ■ Abstract. The smoothed analysis of algorithms is concerned with the expected running 

» I I time of an algorithm under slight random perturbations of arbitrary inputs. Spielman and 

C^' Teng proved that the shadow-vertex simplex method has polynomial smoothed complexity. 
On a slight random perturbation of an arbitrary linear program, the simplex method finds 

t*^ ' the solution after a walk on polytope(s) with expected length polynomial in the number 

C^ ■ of constraints n, the number of variables d and the inverse standard deviation of the 

i__i, perturbation l/cr. 

^0 ' We show that the length of walk in the simplex method is actually polylogarithmic in 

^J ■ the number of constraints n. Spielman- Teng's bound on the walk was ©'(n^^d^^o-""^"), up 

C/3 , to logarithmic factors. We improve this to 0(log^n(d® + d^a^*)). This shows that the 

I ^1 ' tight Hirsch conjecture n — d on the length of walk on polytopes is not a limitation for 

■ the smoothed Linear Programming. Random perturbations create short paths between 

\1 ' ' vertices. 

ji-v ' We propose a randomized phase-I for solving arbitrary linear programs, which is of 

l/^ . independent interest. Instead of finding a vertex of a feasible set, we add a vertex at 

>— P I random to the feasible set. This does not affect the solution of the linear program with 

^—>. • constant probability. So, in expectation it takes a constant number of independent trials 

^^^ , until a correct solution is found. This overcomes one of the major difficulties of smoothed 

^^ ' analysis of the simplex method - one can now statistically decouple the walk from the 

^ • smoothed linear program. This yields a much better reduction of the smoothed complexity 

. • , to a geometric quantity - the size of planar sections of random polytopes. We also improve 

, J^ ' upon the known estimates for that size, showing that it is polylogarithmic in the number 

j^ ' of vertices. 
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1. Introduction 

The simplex method is "the classic example of an algorithm that is known to perform 
well in practice but which takes exponential time in the worst case" [6]. In an attempt to 
explain this behavior, Spielman and Teng [6j introduced the concept of smoothed analysis 
of algorithms, in which one measures the expected complexity of an algorithm under slight 
random perturbations of arbitrary inputs. They proved that a variant of the simplex method 
has polynomial smoothed complexity. 

Consider a linear program of the form 

maximize (z,x) 

(LP) 
subject to Ax < b, 

where A is an n x d matrix, representing n constraints, and a; is a vector representing d 
variables. 

A simplex method starts at some vertex xq of the polytope Ax < b, found by a phase-I 
method, and then walks on the vertices of the polytope toward the solution of (LP). A pivot 
rule dictates how to choose the next vertex in this walk. The complexity of the simplex 
method is then determined by the length of the walk - the number of pivot steps. 

So far, smoothed analysis has only been done for the shadow-vertex pivot rule introduced 
by Gaas and Saaty [7j. The shadow- vertex simplex method first chooses an initial objective 
function zq optimized by the initial vertex xq. Then it interpolates between zq and the 
actual objective function z. Namely, it rotates zq toward z and computes the vertices that 
optimize all the objective functions between zq and z. 

A smoothed linear program is a linear program of the form (LP), where the rows a, of 
A, called the constraint vectors, and b are independent Gaussian random vectors, with 
arbitrary centers Oj and b respectively, and whose coordinates are independent normal 
random variables with standard deviations crmaxj ||(aj,6j)||. Spielman and Teng proved 

Theorem 1.1. ^J For arbitrary linear program with d > 3 variables and n > d constraints, 
the expected number of pivot steps in a two-phase shadow-vertex simplex method for the 
smoothed program is at most a polynomial V{n,d,a^^). 



Spielman-Teng's analysis yields the following estimate on expected number of pivot steps: 

where the logarithmic factors are disregarded. The subsequent work of Deshpande and 
Spielman [5] improved on the exponents of d and a; however, it doubled the exponent of n. 

Another model of randomness, in which the directions of the inequalities in (LP) are 
chosen independently at random, was studied in the eighties [U [U [ISj El [2] . It was shown 
that a shadow- vertex simplex method solves such problems in 0{d'^) expected number of 
pivot steps. Note that this bound does not depend on the number of constraints n. However, 
it is not clear whether reversion of the inequalities can be justified as a good model for typical 
linear problems. 

These results lead to the question - how does the smoothed complexity of the simplex 
method depend on the number of constraints n? Unlike in the model with randomly reversed 
inequalities, the number of pivots steps in the smoothed analysis must be at least logarithmic 
in n (see below). 

In this paper, we prove the first polylogarithmic upper bound: 

Theorem 1.2 (Main). The expected number of pivot steps in Theorem \l.l\ is at most 

V{n,d,a-^) < 0{log^n{d^ + d^(j-'^)). 
See Theorem 16.11 for a more precise estimate. 

So, the number of pivot steps is polylogarithmic in the number of constraints n, while the 
previous known bounds were polynomial in n. 

This bound goes in some sense beyond the classical Hirsch conjecture on the diameter of 
polytopes, the maximal number of steps in the shortest walk between any pair of vertices 
(see e.g. [IQ]). Hirsch conjecture states that the diameter of a polytope with n faces in d 
dimensions, and in particular of the feasible polytope of (LP), is at most n — d. 

Hirsch conjecture is tight, so it would be natural to think of n — d as a lower bound on the 
worst case complexity of any variant of the simplex method. Theorem ll.2l (and Theorem l6.2l 
below) claim that a random perturbation creates a much shorter path between a given pair 
of vertices. Moreover, while Hirsch conjecture does not suggest any algorithm for finding a 
short walk, the shadow-vertex simplex method already finds a much shorter walk! 

One can wonder whether a random perturbation creates a short path between the vertices 
by destroying most of them. This is not so even in the average case, when ^ is a matrix with 
i.i.d. Gaussian entries. Indeed, the expected number of vertices of the random polytope is 
asymptotic to 2^d~^/'^{d — l)~^(7rlogn)'^'^~^)/^ ([12], see [9]). This bound is exponential in 
d and sublinear but not polylogarithmic in n, as Theorem 11.21 claims. 
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For d = 2, the random polygon will have il,(^/\ogn) vertices, which is thus a lower bound 
on the number of pivot steps of (any) simplex algorithm. 

The smoothed complexity (expected running time) of the simplex method is 0{V{n, d,a^^) tpivot), 
where tpivot is the time to make one pivot step under the shadow-vertex pivot rule. The 
dependence of tpivot on n is at most linear, for one only needs to find an appropriate vector 
ai among the n vectors to update the running vertex. However, for many well structured 
linear problems the exhaustive search over all Oj is not necessary, which makes tpivot niuch 
smaller. In these cases. Theorem 11.21 shows that the shadow-vertex simplex method can solve 
very large scale prohlem,s (up to exponentially many constraints). 

Acknowledgements. The author is grateful to the referees for careful reading of the man- 
uscript and many useful remarks and suggestions, which greatly improved the paper. 

2. Outline of the approach 

Our smoothed analysis of the simplex method is largely inspired by that of Spielman 
and Teng [B]. We resolve a few conceptual difficulties that arise in fB], which eventually 
simplifies and improves the overall picture. 

2.1. Interpolation: reduction to unit linear programs. First, we reduce an arbitrary 
linear program (LP) to a unit linear program - the one for which 6=1. This is done by a 
simple interpolation. This observation is independent of any particular algorithm to solve 
linear programs. 

An interpolation variable will be introduced, and (LP) will reduce to a unit program in 
dimension d+\ with constraint vectors of type (oj, 6j). A simple but very useful consequence 
is that this reduction preserves the Gaussian distribution of the constraints - if (LP) has 
independent Gaussian constraints (as the smoothed program does), then so does the reduced 
unit program. 

2.2. Duality: reduction to planar sections of random polytopes. Now that we have 
a unit linear program, it is best viewed in the polar perspective. The polar of the feasible 
set Ax < 1 is the polytope 

P = conv(0, ai, . . . ,««)• 

The unit linear problem is then equivalent to finding facet (z), the facet of P pierced by the 
ray {tz : t > 0}. In the shadow- vertex simplex method, we assume that phase-I provides 
us with an initial objective vector zq and the initial facet (zq). Then phase-II of the simplex 
method computes facet(g) for all vectors q in the plane E = span(2;o, z) between zq and z. 
Specifically, it rotates q from zq toward z and updates facet (g') by removing and adding one 
vertex to its basis, as it becomes necessary. At the end, it outputs facet(z). 



The number of pivot steps in the simplex method is bounded by the number of facets 
of P the plane E intersects. This is the size of the planar section of the random polytope 
P, the number of the edges of the polygon P D E. Under a hypothetical assumption that 
E is fixed or is statistically independent of P, the average size of P n E' is polynomial in 
the dimension d and the reciprocal of the standard deviation a and polylogarithmic in the 
number of vertices n, see Theorem 12. II below. 

The main complication of the analysis in [6] was that plane E = span(2;o,2:) was also 
random, and moreover correlated with the random polytope P. It is not clear how to find 
the initial vector zq independent of the polytope P and, at the same time, in such a way 
that we know the facet of P it pierces. Thus the main problem rests in phase-I. None of the 
previously available phase-I methods in linear programming seem to effectively overcome 
this difficulty. 

The randomized phase-I proposed in [6] exposed a random facet of P by multiplying a 
random d-subset of the vectors Oi by an appropriately big constant to ensure that these 
vectors do span a facet. Then a random convex linear combination of these vectors formed 
the initial vector zq. This approach brings about two complications: 

(a) the vertices of the new random polytope are no longer Gaussian; 

(b) the initial objective vector zq (thus also the plane E) is correlated with the random 
polytope. 

Our new approach will overcome both these difficulties. 



2.3. Phase-I for arbitrary linear programs. We propose the following randomized 
phase-I for arbitrary unit linear programs. It is of independent interest, regardless of its 
applications to smoothed analysis and to the simplex method. 

Instead of finding or exposing a facet of P, we add a facet to P in a random direction. We 
need to ensure that this facet falls into the numb set of the linear program, which consists 
of the points that do not change the solution when added to the set of constraint vectors 
(oj). Since the solution of the linear program is facet(z), the affine half-space below the 
affine span of facet(z) (on the same side as the origin) is contained in the numb set. Thus 
the numb set always contains a half-space. 

A random vector zq drawn from the uniform distribution on the sphere S is then 
in the numb half-space with probability at least 1/2. Moreover, a standard concentration 
of measure argument shows that such a random point is at distance Q,{d~^''^) from the 
boundary of the numb half-space, with constant probability. (This distance is the observable 
diameter of the sphere, see |llj Section 1.4). Thus a small regular simplex with center zq 
is also in the numb set with constant probability. Similarly, one can smooth the vertices of 
the simplex (make them Gaussian) without leaving the numb set. Finally, to ensure that 
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such simplex will form a facet of the new polytope, it suffices to dilate it by the factor 
M = maxj=i,...,„ ||ai||. 

Summarizing, we can add d linear constraints to any linear program at random, without 
changing its solution with constant probability. 

Note that it is easy to check whether the solution is correct, i.e. that the added constraints 
do not affect the solution. The latter happens if and only if none of the added constraints 
turn into equalities on the solution x. Therefore, one can repeatedly solve the linear program 
with different sets of added constraints generated independently, until the solution is correct. 
Because of a constant probability of success at every step, this phase-I terminates after an 
expected constant number of steps, and it always produces a correct initial solution. 

When applied for the smoothed analysis of the simplex method, this phase-I resolves one 
of the main difficulties of the approach in [6j. The initial objective vector zq and thus the 
plane E become independent of the random polytope P. Thus the smoothed complexity of 
the simplex method gets bounded by the number of edges of a planar section of a random 
polytope P, whose vertices have standard deviation of order 0,*{in.m{a,d~^''^)), see (|5.ip . In 
the previous approach [6] , such reduction was made with the standard deviation of order 

A deterministic phase-I is also possible, along the same lines. We have used that a 
random point in S is at distance Q{d~^''^) from a half-space. The same property is 
clearly satisfied by at least one element of the set {±ei, . . . , ie^}, where {ei, . . . , e^} is the 
canonical basis of W^. Therefore, at least one of d regular simplices of radius 2 d^^ " centered 
at points Cj, lies in the numb half-space. One can try them all for added constraints; at least 
one will give a correct solution. This however will increase the running time by a factor of d 
- the number of trials in this deterministic phase-I may be as large as d, while the expected 
number of trials in the randomized phase-I is constant. The smoothed analysis with such 
phase-I will also become more difficult due to having d non-random vertices. 

2.4. Sections of randomly perturbed polytopes. To complete the smoothed analysis 
of the simplex algorithm, it remains to bound the size (i.e. the number of edges) of the 
section P n -E of a randomly perturbed polytope P with a fixed plane E. 

Spielman and Teng proved the ffist polynomial bound on this size. We improve it to a 
polylogarithmic bound in the number of vertices n: 

Theorem 2.1 (Sections of random polytopes). Let ai,...,a„ be independent Gaussian 
vectors in M.'^ with centers of norm at most 1, and with standard deviation a. Let E be a 
plane in M . Then the random polytope P = conv(ai, . . . , a„) satisfies 

E|edges(Pn^)| = ©(dMog^n + dV"^). 



See Theorem 16.21 for a slightly more precise result. 

Spielman and Teng obtained a weaker estimate 0{nd^a^^) for this size ([6] Theorem 4.0.1). 
A loss of the factor of n in their argument occurs in estimating the angle of incidence ([6] 
Lemma 4.2.1), the angle at which a fixed ray in E emitted from the origin meets the facet 
of P it pierces. 

Instead of estimating the angle of incidence from one viewpoint determined by the origin 
0, we will view the polytope P from three different points Oi, O2, O3 on E. Rays will be 
emitted from each of these points, and from at least one of them the angle of incidence will 
be good (more precisely, the angle to the edge of P n £^, which is the intersection of the 
corresponding facet with E). 



3. Preliminaries 

3.1. Notation. The direction of a vector x in a vector space is the ray {tx : t > 0}. The 
non- negative cone of a set K, denoted cone{K), is the union of the directions of all the 
vectors x in K. The closed convex hull of K is denoted by conv(i^) and A(K). (We prefer 
the latter notation when K consists of d points in W^). 

The standard inner product in W^ is denoted by (x, y), and the standard Euclidean norm 
in M is denoted by ||x||. The unit Euclidean sphere in R is denoted by S . 

The polar of a set K in M"' is defined as K° = {x ^R"^ : {x, y) < 1 Vy G i^}. 

A half-space in M*^ is a set of the form {x : {z, x) < 0} for some vector z. An affine 
half-space takes the form {x : {z, x) < a} for some vector z and a number a. The definitions 
of a hyperplane and affine hyperplane are similar, with equalities in place of the inequalities. 
The normal to an affine hyperplane H that is not a hyperplane is the vector h such that 
H = {x : {h, x) = 1}. A point x is said to be below H if (/i, x) < 1. We say that a direction 
z pierces the hyperplane H (or a subset Hq thereof) if the ray {tz : t > 0} intersects H 
(respectively, Hq). 

The probability will be denoted by P, and the expectation by E. The conditional proba- 
bility on a measurable subset S of a probability space is denoted by P{-|S} and defined as 
F{A\B} = F{A n B}/F{B}. 

A Gaussian random vector g = {gi, . . . , g^) in M with center g = {gi, . . . , gd) and variance 
(T is a vector whose coordinates gi are independent Gaussian random variables with centers 
gi and variance a. 

Throughout the paper, we will assume that the vectors (oj, bi) that define the linear pro- 
gram (LP) are in general position. This assumption simplifies our analysis and it holds with 
probability 1 for a smoothed program. One can remove this assumption with appropriate 
modifications of the results. 
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A solution X of (LP) is determined by a d-set / of the indices of the constraints {ai,x) < hi 
that turn into equations on x. One can compute x from / by solving these equations. So 
we sometimes call the index set / a solution of (LP). 

For a polytope P = conv(0, ai, . . . ,a„) and a vector z in W^, facet(2:) = facetp(z) will 
denote the set of all faces of P which the direction z pierces. The point where it pierces 
them (which is unique) is denoted by zp. More precisely, facet (z) is the family of all d-sets 
/ such that A(aj)ig/ is a facet of the polytope P and the direction z pierces it. 

If z is in general position, facet (z) is an empty set or contains exactly one set /. The 
corresponding geometric facet A(aj)jg/ will be denoted by Facetp(2;) or Facetp(/). 

Positive absolute constants will be denoted by C, Ci,c, ci, . . .. The natural logarithms 
will be denoted by log. 

3.2. Vertices at infinity. For convenience in describing the interpolation method, we will 
assume that one of the constraint vectors Oj can be at infinity, in a specified direction 
u G M*^. The definitions of the positive cone and the convex hull are then modified in 
a straightforward way. If, say, aj is such an infinite vector and j € /, then one defines 
^iO'i)iei = ^{(^i)iei-{j} + {^^ • ^ — 0}i where the addition is the Minkowski sum of two 
sets, A + B = {a + b: ae A, be B}. 

Although having infinite vectors is convenient in theory, all computations can be per- 
formed with numbers bounded by the magnitude of the input (e.g., checking I E facet (z) 
for given z and / has the same complexity whether or not some vertex of P is at infinity). 

3.3. Polar shado'w vertex simplex method. This is the only variant of the simplex 
method whose smoothed complexity has been analyzed. We shall describe this method 
now; for more information see [.6j Section 3.2. 

The polar shadow vertex simplex method works for unit linear programs, i.e. programs 
(LP) with 6 = 1. A solution of such program is a member of facet (z) of the polytope 
P = conv(0, ai, . . . , a„). The program is unbounded iff facet(2;) = 0. (See [6] Section 3.2). 

The input of the polar shadow vertex simplex method is: 

(1) objective vector z; 

(2) initial objective vector zq; 

(3) facet(zo)) provided that it consists of only one set of indices. 

The simplex method rotates zq toward z and computes facet (g) for all vectors q between 
Zq and z. At the end, it outputs the limit of facet ((7) as q approaches z. This is the last 
running facet(g) before q reaches z. 

If facet(zo) contains more than one index set, one can use the limit of facet(g) as q 
approaches zq as the input of the simplex method. This will be the first running facet (g) 
when q departs from zq. 



If z and zq are linearly dependent, zq = —cz for some c > 0, one can specify an arbitrary 
direction of rotation u £ M", which is linearly independent of z, so that the simplex method 
rotates q in span{z,u) in the direction of u, i.e. one can always write q = ciz + C2U with 
C2 >0. 

4. Reduction to unit programs: Interpolation 
We will show how to reduce an arbitrary linear program (LP) to a unit linear program 

maximize {z, x) 

(Unit LP) 
subject to Ax < 1. 

This reduction, whose idea originates from [6j, is quite general and is independent from any 
particular method to solve linear programs. 

The idea is to interpolate between (Unit LP) and (LP). To this end, we introduce an 
additional (interpolation) variable t and a multiplier A, and consider the interpolated linear 
program with variables x, t: 

maximize {z, x) + At 

(Int LP) 
subject to ^x < t6+ (1 -t)l, < t < 1. 

The interpolated linear program becomes (Unit LP) for t = and (LP) for t = 1. We can 
give bias to t = by choosing the multiplier A ^ — cxd and to t = 1 by choosing A -^ +oo. 
Furthermore, (Int LP) can be written as a unit linear program in M'^"*"^: 

maximize {{z , X) , (x , t)) 

^. ,, ({{a,,l-b,),{x,t))<l, (Int LP') 

subject to < 

[((0,1), (x,t)) <1, ((0,-oo),(x,t)) <L 

The constraint vectors are (oj,! — 6j), (0,1) and (0,— oo). (see Section [3^ about vertices 
at infinity). This has a very useful consequence: if the constraints of the original (LP) are 
Gaussian, then so are the constraints of (Int LP'), except the two last ones. In other words, 
the reduction to a unit program preserves the Gaussian distribution of the constraints. 
The properties of interpolation are summarized in the following elementary fact. 

Proposition 4.1 (Interpolation). 

(i) (LP) is unbounded iff (Unit LP) is unbounded iff (Int LP) is unbounded for all 
sufficiently big A iff (Int LP) is unbounded for some A. 

(a) Assume (LP) is not unbounded. Then the solution of (Unit LP) equals the solution 
of (Int LP) for all sufficiently small A; in this solution, t = 0. 

(Hi) Assume (LP) is not unbounded. Then (LP) is feasible iff t = 1 in the solution of 
(Int LP) for all sufficiently big A. 
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(iv) Assume (LP) is feasible and bounded. Then the solution of (LP) equals the solution 
of (Int LP) for all sufficiently big X. 

Proof. See Appendix El ■ 

Now assuming that we know how to solve unit linear programs, we will be able to solve 
arbitrary linear programs. The correctness of this two-phase algorithm follows immediately 
from Proposition 14. 1[ 

Solver for (LP) 



Phase-I: Solve (Unit LP) using Solver for (Unit LP) of Section [5l If this 
program is unbounded, then (LP) is also unbounded. Otherwise, the solution 
of (Unit LP) and t = is a limit solution of (Int LP) as A — > — oo. Use this 
solution as the input for the next step. 

Phase-II: Use the polar shadow-vertex simplex method to find a limit solution 
of (Int LP) with A — > -|-oo. If t 7^ 1 in this solution, then the (LP) is infeasible. 
Otherwise, this is a correct solution of (LP). 



While this algorithm is stated in terms of limit solutions, one does not need to take actual 
limits when computing them. This follows from the properties of the polar shadow-vertex 
simplex method described in Section [3.31 Indeed, in phase-II of Solver for (LP) we can 
write (Int LP) as (Int LP') and use the initial objective vector zq = (0, —1), the actual 
objective vector z = (0,1), and the direction of rotation u = {z,0). Phase-I provides us 
with a limit solution for the objective vectors {ez, —l) = ZQ + euase^O~^. These vectors 
approach zq as we rotate from z toward zq in span(2;, u). Similarly, we are looking for a limit 
solution for the objective vectors {ez, 1) = z + eu as e ^^ 0^. These vectors approach z as 
we rotate from zq toward z in span(z, n). By Section [3.31 the polar shadow- vertex simplex 
method applied with vectors zq, z, u and the initial limit solution found in Phase-I, finds 
the correct limit solution in Phase-II. 

5. Solving unit programs: Adding constraints in Phase-I 

We describe a randomized phase-I for solving arbitrary unit linear problems of type 
(Unit LP). Rather than finding an initial feasible vertex, we shall add a random vertex to 
the feasible set. We thus add d constraints to (Unit LP), forming 

maximize {z,x) 

(Unit LP+) 
subject to A'^x < 1, 

where A'^ has the rows ai, . . . , a^, On+i, • • • , On+d with some new constraint vectors On+i, . . . , Un+d- 
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The first big question is whether the problems (Unit LP) and (Unit LP^) are equivalent, 
i.e. whether (Unit LP^) is bounded if and only if (Unit LP) is bounded, and if they are 
bounded, the solution of (Unit LP+) equals the solution of (Unit LP). This motivates: 

Definition 5.1. A numb set of a unit linear program is the set of all vectors a so that 
adding the constraint {a, x) < 1 to the set of the constraints produces an equivalent linear 
program. 

We make two crucial observations - that the numb set is always big, and that one can 
always check if the problems (Unit LP) and (Unit LP^) are equivalent. As mentioned in 
Section 13.11 we will assume that the constraint vectors Oj are in general position. 

Proposition 5.2. The numb set of a unit linear program contains a half-space (called a 
numb half-space j. 

Proof. Given a convex set K containing the origin in a vector space, Minkowski functional 
\\z\\k is defined for vectors z as \\z\\k = inf{A > : jz ^ K} if the infimum exists, 
and infinity if it does not exist. Then the duality shows that the solution maxAx<i{z,x) 
of (Unit LP) equals ||2;||p. (It is infinity iff the problem is unbounded; we will use the 
convention 1/co = in the sequel). By Hahn-Banach (Separation) Theorem, there exists a 
vector z* such that 

{z* ,x) <{z*,j^z) ■.= h for all x e P. 
G P implies that h >0. We define the affine half-space 

H- ={x: {z*,x) <h} 

and claim that H~ lies in the numb set of (Unit LP). To prove this, let a G H~ . Since 
P C H^ , we have conv(P U a) C H^ , thus 

\\z\\p > ||2:||conv{PUa) > \\z\\h- = \\z\\p 

where the first two inequalities follow from the inclusion P C conv(PU a) C H^ , and the 
last equality follows from the definition oi H^ . So, we have shown that ||2:||conv{Pua) = Ikll-P) 
which says that a and thus the affine half-space H~ is in the numb set of (Unit LP). Since 
h > 0, H^ contains the origin, thus contains a half-space. ■ 

In particular, if (Unit LP) is bounded, then its numb set is the affine half-space below 
facet (z). Then a similar duality argument proves: 

Proposition 5.3 (Equivalence). 

(i) If the added constraint vectors a„+i, . . . , an+d He in some numb half-space of (Unit LP), 
then (Unit LP^) is equivalent to (Unit LP). 
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(ii) (Unit LP^ ) is equivalent to (Unit LP) if and only if either (Unit LP^ ) is unbounded 
or its solution does not satisfy any of the added constraints {ai,x) < 1, i = n + 1, . . . ,n + d. 



Proposition 15.21 implies that a constraint vector zq whose direction is chosen at random 
in the unit sphere S , is in the numb set with probabihty at least 1/2. By a standard 
concentration of measure argument, a similar statement will be true about a small simplex 
centered at zq. It is then natural to take the vertices of this simplex as added constraint 
vectors an+i, ■ ■ ■ , On+d for (Unit LP+). To this end, we define the size i of the simplex and 
the standard deviation ai for smoothing its vertices as 

ai=minf-^===-, — ^i-— ), (5.1) 



\/log d ' V6\/dlogn' d^/^logd 

where ci = ^qq and C2 = j^- Then we form (Unit LP"*") as follows: 

Adding Constraints 



Input: Size Mq > and rotation U £ 0{d). 

Output: "Failure" or vectors fln+i, • • • i On+d and zq S cone(a„+i, . . . , a„+d). 

(1) Form a regular simplex: let z'q be a fixed unit vector in M^ and a'^^i, . . . , a'„_|_^ 
be the vertices of a fixed regular simplex in W^ with center and normal z'^, and 
radius \\zq — a[\\ = L 

(2) Rotate and dilate: let zq = 2MqUzq, Oj = 2MoC/a^ foi i = n + 1, . . . , n + d. 

(3) Smooth: let Oj be independent Gaussian random variables with mean en and 
standard deviation 2Mocri, for i = n + 1, . . . , n + d. 

(4) Check if the constraints added correctly: check if 

(a) Zq £ cone(a„+i, . . . , a„+rf) and 

(b) distance from to aff (a^+i, . . . , Un+d) is at least M. 
If not, return "Failure". 



Remark. Steps (3) and (4) are, strictly speaking, not necessary. They facilitate the 
theoretical smoothed analysis of the simplex method. However, they can be skipped in 
practical implementations. 

The crucial property of Adding Constraints is the following. (Recall that we regard 
a solution of a linear program as the index set of the inequalities that become equalities on 
the solution point, see Section [3TT]) . 



Theorem 5.4. Let (Unit LP) be a unit linear program with a numb half-space H , and let 
Mq > M where M = maxj=i ... „ \\<ii\\- Then: 
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1. Let U G 0{d) be arbitrary. If the algorithm Adding Constraints does not return 
"Failure", then a solution of (Unit LP^ ) with the objective function {zo,x) is the index set 
{n + l,. . . ,n + d}. 

2. With probability at least 1/4 in the choice of a random rotation U G 0(d) and random 
vectors a„+i, . . . jfln+d; the algorithm Adding Constraints does not return "Failure" and 
the vectors On+i, • • • ,an+d Hg in the numb half-space H. 

Proof. See Appendix |Bl ■ 

By Proposition 15.31 the conclusion of Theorem 15.41 is that: 

(a) with constant probability the problems (Unit LP+) and (Unit LP) are equivalent; 

(b) we can check whether they are equivalent or not (by part (ii) of Proposition 15. 3p : 

(c) we always know a solution of (Unit LP^) for some objective function (if "Failure" is 
not returned). 

Thus we can solve (Unit LP) by repeatedly solving (Unit LP"*") with independently added 
constraints until no "Failure" is returned and until the solution is correct. This forms a 
two-phase solver for unit linear programs. 

Solver for (Unit LP) 



Do the following until no "Failure" is returned and the solution I^ contains none of 
the indices n + 1, . . . , n + d: 

Phase-I: Apply Adding Constraints with Mq = er'°s^^T where M = 
maxj=i^,..^„ ||aj|| and with the rotation U chosen randomly and independently 
in the orthogonal group 0{d) according to the Haar measure. If no "Failure" 
returned, then {n + l, ... ,n + d} is a solution of (Unit LP"*") with the objective 
function {zq,x). Use this solution as the input for the next step. 

Phase-II: Use the polar shadow-vertex simplex method to find a solution /+ of 
(Unit LP+) with the actual objective function {z,x). 
Return I"*". 



Remark. The discretized maximum Mq is introduced in this algorithm only to simplify its 
smoothed analysis. In practical implementations of the algorithm, one can use Mq = M. 

6. Bounding the gomplexity via segtions of random polytopes 

We do here the smoothed analysis of Solver for (LP), and prove the following more 
precise version of Theorem 11.21 

Theorem 6.1. For an arbitrary linear program with d > 3 variables and n > d constraints, 
the expected number of pivot steps in a two-phase shadow-vertex simplex method for the 
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smoothed program is 

0(log2 n ■ log log n • (dV"^ + d^ log^n + d^ log^ d)) = 0*{d^ + dV"^). 

Remark. A more careful analysis may allow one to remove the factor log n ■ log log n. To 
do this, one uses the version of the algorithm with Mq = M and bounds the sections of a 
polytope with an added facet. This makes the analysis a bit harder, because the magnitudes 
of the added vertices correlate with M and thus with the polytope. 

To prove Theorem 16. 1^ let us recall how many calls Solver for (LP) makes to the 
polar shadow- vertex method. One call is made to solve (Int LP) (in the second phase), and 
several calls are made to solve (Unit LP+) (in the subroutine Solver for (Unit LP) in 
the first phase.) 

The expected number of calls (iterations) in Solver for (Unit LP) is 4. This follows 
from part 2 of Theorem 15.41 and Proposition 15. 3i Thus: 

The running time of SOLVER FOR (LP) is bounded by the total number of 
pivot steps made in the polar shadow-vertex simplex method, when we apply 
it: 

(1) once for (Int LP); 

(2) on average, four times for (Unit LP^). 

Furthermore, as explained in Section 12.21 the number of pivot steps in the polar shadow- 
vertex simplex method on a unit linear program is bounded by the number of edges of the 
polygon P f] E, where P is the convex hull of the origin and the constraint vectors, and E 
is the span of the initial and the actual objective vectors. 

We shall now first estimate the size of the section of the polytope for (1) and (2) separately, 
and then combine them by a simple stopping time argument. 

Recall that the vectors a^ and b are Gaussian random vectors with centers a, and b 
respectively, and with standard deviation a. We can assume without loss of generality that 

||(ai,6i)||<l foralH = l,...,n, a< J. (6.1) 

(To achieve these bounds, we scale down the vectors if necessary - first to achieve maxj ||(aj, 6j^ 
1, then further to make a as required). 

6.1. Sections of random polytopes. When we apply the polar shadow-vertex simplex 
method for (Int LP) (in phase-II of SOLVER FOR (LP)), the plane 

£; = span((z,0),(0, 1)) 

is fixed, and the constraint vectors are 

(0,1), (0,-cx)), and(ai,l-6i)r=i- 
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The vertices (0, 1), (0, — oo) and the origin can be removed from the definition of P using 
the elementary observation that \i a ^ E then the number of edges of conv(P U a) n i? is at 
most the number of edges oi Pr\E pkis 2. Since (0, 1), (0, — oo) and do he in E, they can 
be ignored at the cost of increasing the number of edges by 6. Thus we can assume that 

P = conv(aj,l -6i)"=i, 

where the vectors (a^, 1 — ftj) are independent Gaussian vectors with centers of norm at most 
2, and with standard deviation a. 

Scahng these vectors down so that their norms become at most 1, we deduce the desired 
size of the section P D E from the following theorem. It gives a desired bound for the 
number of pivots when we solve (Int LP). 

Theorem 6.2 (Sections of random polytopes). Let ai,...,a„ be independent Gaussian 
vectors in M with centers of norm at most 1, and whose standard deviation a satisfies 
(j6.ip . Let E be a plane in W^. Then the random polytope P = conv(ai, . . . , a^) satisfies 

E I edges(P r\ E)\ < Cd^a'^, (6.2) 

where C is an absolute constant. 

Proof. See Section [71 ■ 

Remark. Spielman and Teng obtained a weaker estimate 0{nd^a~^) for this size ([6] 
Theorem 4.0.1). Because of the polynomial, rather than a polylogarithmic, dependence on 
n, their bound is not sufficient for us. 

Summarizing, we have shown that: 

(Int LP) makes in expectation D{d,a) + 6 pivot steps, (6-3) 

where D(d, a) denotes the right hand side in (j6.2p . 

6.2. Sections of a random polytope with an added facet. When we repeatedly apply 
the polar shadow-vertex simplex method for (Unit LP+) (in Solver for (Unit LP)), 
each time we do so with U chosen randomly and independently of everything else. Let us 
condition on a choice of U. Then the plane E is fixed: 

E = span(-r — -, z) = span{UzQ, z). 
Il^oll 

The constraint vectors are 

0-1, • • • ) O-n, fln+l; • • • ) 0-n+d- 

The first n of these are independent Gaussian vectors with centers of norm at most 1 and 
whose standard deviation a satisfies (16. ID. 
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The last d are also Gaussian vectors chosen independently with centers 2Moaj and vari- 
ance 2Mo<Ti, where ai{= Ua'j) are fixed vectors of norm 



la.ll = A/ llz' ||2 + 114 - a'P = y/lTP < 1.01. 



(Here we used the orthogonality of z'q and Zq — a^, which holds by the construction of these 
vectors in algorithm Adding Constraints). 

Recall that Mq = e"°^^'^^, where M = maxj=i^...^„ ||ai|| and ui is as in (|5.1|) . 

Let $(«!, . . . jOn+d) denote the density of such vectors as above. One should note that 
the last d of vectors correlate with the first n vectors through the random variable Mq. This 
difficulty will be resolved by an argument similar to that in [6]. We will show that with 
high probability, Mq takes values in a small finite set. For each fixed Mq in this set, all the 
vectors oi, . . . ,an+d are independent, so we can use Theorem 16.21 to get the desired size of 
the section P f] E: 

Corollary 6.3 (Sections of random polytopes with an added facet). Let ai,... ^an+d be 
random vectors in W^ with joint distribution ^{ai, . . . , an+d)- Let E be a plane in M'^. Then 
the random polytope P = conv(ai, . . . , an+d) satisfies 

E|edges(Pn£;)| < Cloglogn • Z?(d,cro) 

where D{d,a) denotes the right hand side of ()6.2p . where 

(To = clog~^" n ■ min(cr, (Ti), 

and where c is an absolute constant. 

Proof. See Section El ■ 

Thus, similarly to the previous section, we have shown that: 

(Unit LP^ ) makes in expectation Clog log n • D{d, (Tq) + 6 pivot steps. (6.4) 

6.3. The total number of pivot steps. As we mentioned in the beginning of this section, 
the total number of 

pivot steps in SOLVER FOR (LP) = Y + Z, (6.5) 

where Y is the number of pivot steps to solve (Int LP), and Z is the total number of pivot 
steps to solve (Unit LP+) over all iterations Solver for (Unit LP) makes. 
Then (j6.3p states that 

WY <D{d,a)+Q. (6.6) 

Furthermore, the expected number of iterations in Solver for (Unit LP) is at most four. 
Then ()6.4p yields a good bound for Z. This is rigorously proved by the following simple 
stopping time argument. 
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Consider a variant of Solver for (Unit LP), from which the stopping condition is 
removed, i. e. which repeatedly apphes phase-I and phase-II in an infinite loop. Let Z^ 
denote the number of pivot steps in phase-II of this algorithm in k-th iteration, and F^ 
denote the random variable which is 1 if A;-th iteration in this algorithm results in failure, 
and otherwise. Then the expected total number of pivot steps made in the actual Solver 
FOR (Unit LP), over all iterations, is distributed identically with 

k=i j=i 
To bound the expectation of Z, we denote by Eq the expectation with respect to random 
(smoothed) vectors (ai,...,a„), and by Ej the expectation with respect to the random 
choice made in j-th iteration of Solver for (Unit LP), i. e. the choice of U and of 

[O-n+l, . . . , CLn+d)- 

Let us first condition on the choice of (ai, . . . , a^)- This fixes the numb set, which makes 
each Fj depend only on the random choice made in j'-th iteration, while Zk will only depend 
on the random choice made in k-th iteration. Therefore 

oo fc— 1 

EZ = Eo J^(EfcZfc) J] E,Fj. (6.7) 

k=i j=i 

As observed above, 

EjFj = F{Fj = 1) < 3/4, 

which bounds the product in (j6.7p by (3/4) . Moreover, EoEfcZ^ are equal to the same 
value for all k because of the identical distribution. Thus 

EoEfcZfc < maxE$Zi, 

where £'$ is the expectation with respect to the random vectors (ai, . . . , an+d) conditioned 
on a choice of U in the first iteration. Thus 

EZ< 4maxE$Zi. 

The random vectors (ai, . . . , a^+d) have joint density <I>, and we can apply ()6.4p to bound 

maxE^Zi < 2Cloglogn • I?((i, o"o). 

Summarizing, 

EZ < 8Clog\ogn-D{d,ao). 

This, (j6.6p . and ()6.5p imply that the expected number of pivot steps in Solver for (LP) 
is at most 

D{d, 0-) + 6 + SCloglogn • D{d, ao) < Ci log^ n • log log n • d^ia'^ + d^ log^ n + d^ log^ d). 
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This proves Theorem 16.11 and completes the smoothed analysis of the simplex method. 

7. Sections of random polytopes with i.i.d. vertices 

The goal of this section is to prove Theorem 16.21 about the size of planar sections of 
random polytopes. 

Our argument improves upon the part of the argument of [6j where it looses a factor of 
n. Recall that we need a poly logarithmic dependence on n. 

In Section 17. H we will outline the counting argument of [6] , the crucial idea of which is to 
reduce the counting problem to the geometric problem of bounding a fixed point away from 
the boundary of a random simplex. In Section [721 we will give a simple "three viewpoints" 
argument that allows us to not loose a factor of n in contrast to the original argument of 
[6]. In further sections, we revisit the counting argument and complete the proof. 

7.1. Spielman-Teng's counting argument. We start by outlining the counting argu- 
ment of [.6j . This argument leads to a loss of a factor linear in n. We shall show then how 
to improve this to a polylogarithmic factor. 

We consider the one-dimensional torus in the plane E defined as T = £■ n S . We 
parametrize it by 

q = q{9) = zsm{e)+tcos{e), $£ [0,27r), 

where z, t are some fixed orthonormal vectors in E. 

Assume for now that the origin always lies in the interior of the polytope P; we will get 
rid of this minor assumption later. Then clearly 

Exp:=E|edges(Pn£;)| = E|{facetp(g) : q£T}\. 

In order to turn this into finite counting, we quantize the torus T by considering 

Tm '■= {'m equispaced points in T}. 

A simple discretization argument (see [6] Lemma 4.0.6) gives that 

E|{facetp(g) : q e T}\ = lim E|{facetp(g) : q G T„}| (7.1) 
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Facetpf^j 




P 

For the proper counting, one would prefer to keep one point q per facet, for example the 
one closest to the boundary of the facet. The closeness here is measured with respect to 
the angular distance: 



Definition 7.1 (Angular distance). Let x and y be two vectors in W^. The angular distance 

ang(x, y) := cos" 



-1 / \{x,y)\ 



\x\\\\y\ 

is the angle formed by the vectors x and y. The angular length ang(X) of an interval I in 
W^ is the angular distance between its endpoints. 

So in the counting formula (j7.ip . we can leave one q per facet, namely the q = q{6) 
with the maximal 9 (according to the parametrization of the torus in the beginning of the 
argument). Therefore, the angular distance of such q to the boundary of Facet pnE('?) (one 
of the endpoints of this interval), and hence also to the boundary of Facetp(g), is at most 
2Ti/m. 

We have proved that 

27r 
Exp < lim E|{facetp(g) : ang (g, 9Facetp(g)) < — , q € T.m}|. 

m— ►oo 771 

With a slight abuse of notation, we shall denote the set of all d-subsets of {1, . . . , n} by (]]) . 
Then 

Exp< lim V V P{facetp(g) =/and ang(g,aA(ai)ig/) < — }. (7.2) 

geT„,e(^) 

For every q, there exists at most one / € (^) such that facetp(g) = I. Hence 

Y^ P{facetp(g) = /} < 1 
^e(^) 
Using this bound in (j7.2p and estimating XlneTm. above by m ■ maxggf^ , we obtain 

Exp < lim m-p{m), (7.3) 
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where 

27r 
p(m) = max F{ang (q, d AiaAi^j) < — I facetp(g) = /}. 
geT™,/G(:;) ' m ' 

Furthermore, one can get rid of the polytope P by analyzing the event facetp(g) = /. 
We condition on the reahzation on the points {ai)i^i':, as weU as on the subspace Ej := 
span(aj)jg/. The randomness remains in the random points ai, i (z I inside the (fixed) 
subspace Ej. Then the event facetp(q) = I holds if and only if the (fixed) point qp where 
the direction of q pierces Ej lies in the the simplex A(aj)jg7. 

We have thus reduced the problem to estimating the distance of a fixed point to the 
boundary of a random simplex, conditioned on the event that the point lies in that simplex. 

The main difficulty is that the distance is angular rather than Euclidean; the latter is 
easier to estimate. Unfortunately, the two distances may be very different. This happens 
if the angle of incidence - the angle at which the direction of q meets the subspace Ej - is 
too small. So Spielman and Teng needed to show that the angle of incidence is at least of 
order 1/n with high probability (Section 4.2 in [6j); consequently, the angular and Euclidean 
distances are within a factor 0{n) from each other. 

In this paper, we can not tolerate the loss of a factor of n since we are proving a complexity 
estimate that is poly logarithmic rather than polynomial in n. We will now present a simple 
way to avoid such a loss. 



7.2. Three vie'wpoints. Instead of estimating the angle of incidence from one viewpoint 
determined by the origin 0, we will view the polytope Pq from three different points Oi, O2, 
O3 on E. Vectors q will be emitted from each of these points, and from at least one of them 
the angle of incidence will be good (more precisely, the angle of q to the intersection of its 
facet with E will be good). This is formalized in the following two elementary observations 
on the plane. 

Lemma 7.2 (Three viewpoints). Let K = conv{bi, ... ,b]\f) be a planar polygon, where 
points bi are in general position and have norms at most 1. Let Oi, O2, O3 be the vertices of 
an equilateral triangle centered at the origin and with norm 4. Denote Ki = conv(Oj,-fC). 
Then, for every edge {bk,bm) of K, there exists i £ {1,2,3} such that {bk,bfn) is an edge of 
Ki, and 

dist(0i,aff(6fc,&m)) > 1- 



21 




Proof. Let L be any line passing through the origin. Then, for every equilateral triangle 
centered at the origin and whose vertices have norms R, there exist two vertices separated 
by the line L and whose distances to L are at least R/2. (The bound R/2 is attained if L 
is parallel to one of the sides of the triangle) . 

Let L be the line passing through the origin and parallel to the edge (6^, bm)- It follows 
that among the three points Oi, O2, O3 there exists at least two (say, Oi and O2) separated 
by the line L and such that 

dist(Oi,L) >4/2 = 2, i = l,2. 

Moreover, since all the points bi have norms at most 1, we have 

dist(L,aff(6fe,6„)) < dist(0,5fc) < 1- 
Then by the triangle inequality, Oi and O2 are separated by the line aff (6^, bm) and 

dist(Oi, aff (6fe, bm)) > 1, i = 1, 2. 

Since Oi and O2 are separated by the affine span of the edge (b^, bm) of the polygon K, 
one of these points (say, Oi) lies on the same side from aS{bk,bm) as the polygon K. It 
follows that {bk,bm) is an edge of conv(Oi,i^). This completes the proof. ■ 

Lemma 7.3 (Angular and Euclidean distances). Let L be a line in the plane such that 

dist(0,L) > 1. 
Then, for every pair of points xi, X2 on L of norm at most 10, one has 

cdist(a;i,X2) < ang(xi,X2) < dist(xi,X2) 
where c= (10^ + 1)"^ 

Proof. Without loss of generality, we may assume that dist(0, L) = 1. Choose unit 
vectors u and v in the plane such that {u, v) = and L = {u + tv : t G M}. Then 

Xl = U + tlV, X2 = U + t2V 



22 



ROMAN VERSHYNIN 



for some ti,t2 G M. 




Hence we have 

\tl\ < \\xi\\ < 10, \t2\ < \\X2\\ < 10. 

Without loss of generahty, ti < ^2- Then one easily checks that 

dist(xi,X2) = t2 — ti, ang(xi, X2) = tan""*^ ^2 — tan^"*^ ti. 
Thus 



ang(xi,a;2) 



t2 



di 



(tan"^ t) dt 



t2 



dt 



l + t2- 



We can estimate the integrand from both sides using the inequalities l<l + t'^<l + t2< 

1 + 102. Thus 

t2-ti 



1 + 102 



< ang(xi,a;2) < t2 — ii- 



This completes the proof. 



In view of Lemma 17.31 we can rephrase Lemma 17.21 as follows: every edge (facet) of K 
can be viewed from one of the three viewpoints Oi, O2, O3 at a nontrivial angle, and yet 
remain an edge of the corresponding polygon conv(Oi, K). 



7.3. Boundedness of the random polytope. In Lemmas 17.21 and 17.31 the boundedness 
requirements (for the points Xi and bi respectively) are clearly essential. To make sure that 
these requirements are satisfied in our setting, we recall that the vertices of the polytope 
P are i.i.d. Gaussian vectors. We shall therefore use the following well-known estimate on 
the size of a Gaussian vector, see e.g. Proposition 2.4.7 in [6]. 

Lemma 7.4. Let g be a Gaussian vector in M" (n > 3) with center g of norm at most 1, 
and with variance a. Then: 

1. We have ¥{\\g - g\\ > 'ia^/dAogn} < n"2-M. 

2. We have Pjll^ll < caVd} < e~'^, where c = e~^/^. 



We consider the event 



£ :-- 



ai 



< 2, i = 1, . . . ,n}. 
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By Lemma 17.41 and using our assumption ()6.ip , we have 

P{£"=} <F{ max lloill >3CTV'dk^+l} < n"2.M+i < QQQj5|M ^^^^ 

i=l,...,n \"'J 

Our goal in Theorem 16.21 is to estimate 

Exp:=E|edges(Pn£;)|. 

The random variable | edges(Pn£')| is bounded above by Q) , which is the maximal number 
of facets of P. Using (j7.4p . it follows that 

Exp := E I edges(P nE)\<E\ edges(P nE)\-l£ + l. 

We will use a similar intersection argument several times in the sequel. 

7.4. Counting argument revisited from three viewpoints. We will apply Lemma [7.2l 

in combination with 17.3] for the random polygon P D E, whenever it satisfies £. All of the 
points of this polygon are then bounded by 2 in norm, so we scale the result in Lemma 17.21 
by the factor 2. 

Let Oi, O2, O3 be the vertices of an equilateral triangle in the plane E, centered at the 
origin and with 

||0i|| = ||02|| = 110311 = 8. (7.5) 

Denote 

Pi = conv(0,-Oi + P). (7.6) 

Lemma 17.21 states in particular that, if £ holds, then each edge ol P D E can be seen as 
an edge from one of the three viewpoints. Precisely, there is a one-to-one correspondencqj 
between the edges of P n £' and the set {facet p.n_B('?) : Q' G T, i = 1, 2, 3}. We can further 
replace this set by {facet p^ (q) : q G T, i = 1,2, 3}, since each facet p^ (q) uniquely determines 
the edge facetp.n£;('?); and vice versa, each edge can belong to a unique facet. Therefore 

Exp < E|{facetp^(g) : q eJ, i = 1,2,3}\ + 1 (7.7) 

and, by a discretization in limit as in Section 17. H 

= lim E|{facetp^(g) : g g T^, z = 1,2,3}| + L (7.8) 

m— >oo 

Moreover, by the same discretization argument, we may ignore in (|7.7p all facets whose 
intersection with E have angular length no bigger than, say, 27: /m. After this, we replace 



Here and in the sequel, we identify facet (g) with the index set it contains. Since the polytope in question 
is almost surely in general position, facet (5) contains at most one index set. 
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Pi hj PiCi E as we mentioned above, and intersect with the event £ again, as before. This 

gives 

27r 
Exp < hm E |{facetp,ni=;(g) of angular length > — ; g E T„, i = 1, 2, 3}| • If + 2. (7.9) 

We are going to apply Lemmas 17.21 and 17.31 for a realization of P for which the event £ 
holds. Consider any facet from the set in (j7.9p . So let 

/ = facet p-n£;(?) for some i G {1, 2, 3} and some q G Tm- 

By Lemma [7. 2| we can choose a viewpoint Oj which realizes this facet and from which its 
intersection with E is seen at a good angle. Formally, among the indices io S {1,2,3} such 
that / = facetp- nEiqo) for some go G Tm, we choose the one that maximizes the distance 
from to the affine span of the edge 

T = Facet p.^nsU) • 

By Lemma 17.21 

dist(0,aff(J)) > L (7.10) 

Because only facets of angular length > 2Tr/m were included in the set in (j7.9p . we have 
ang(T) > 27r/m. It follows that I contains some point q" in T^- 

Summarizing, we have realized every facet / = facet p^n£(^) from (17.9P as / = facetp. ^eW) 
for some Iq and some q" G ¥„. 

Recall that when the event £ holds, all points of P have norm at most 2. Thus all points 
of Pjp have norm at most ||Oip|| + 2 = 10. Since Z C Pig, all points in Z also have norm 
at most 10. Therefore bound (j7.10p yields, by view of Lemma 17.31 that the angular and 
Euclidean distances are equivalent on Z up to a factor of c. 

We shall call a facet (edge) of a polygon nondegenerate if the angular and Euclidean 
distances are equivalent on it up to the factor c. We have shown that 

Exp < lim E I {nondegenerate facetp,n_E(g) : q G Tm, i = 1,2, 3}| • l^- + 2. 

m—>oo 

Each facet may correspond to more than one q. We are going to leave only one q per facet, 
namely the q = q{6) with the maximal 6 (according to the parametrization of the torus in 
the beginning of the argument). Therefore, the angular distance of such q to one boundary 
of Facet p-n_E('7) (one of the endpoints of this interval) is at most 271 /m. The nondegeneracy 
of this facet then implies that the usual distance of gp. to the boundary of Facetp.n£;(g), 
thus also to the boundary of Facetp. (g), is at most - • 27r/m, =: C /m. Therefore 

C 

Exp < lim E I {facetp^ ((7) such that dist(gp^, 9Facetp^(g)) < — , Q G T^, i = 1, 2,3}| • If + 2 

m— >oo fn 

C 

< 3 max lim E [{facetp, (g) such that dist(gp,, 5 Facetp. (g)) < — , q G T,m}| • If + 2. 

1=1,2,3 m—>oo m 
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Recall that by ()7.6p and ()7.5p . the polytope Pk is the convex hull of the origin and a 
translate of the polytope P by a fixed vector of norm 8. Therefore 

Exp < 3maxExpo+2, (7.11) 

where 

(J 
Expo = lim K\{facetp(q) such that distfgp, 9Facetp((7)) < — , q £ Tm}\ • If (7.12) 

m— >oo 771 

and the maximum in (|7.11|) is over all centers of the distributions Oj of norm at most 
1 + 8 = 9. Scaling the vectors down by 9, we can bound ExpQ using the following lemma: 

Lemma 7.5 (Discretized counting). Let ai, . . . , On be independent Gaussian vectors in M 
with centers of norm at most 1, and whose standard deviation a satisfies (|6.ip . Then 

Expo < CodV-^ 

where Cq is an absolute constant. 

This lemma and (j7.1ip complete the proof of Theorem 16.21 

7.5. Proof of Lemma 17.51 As before, (^) will denote the set of all d-subsets of {1, ... , n}. 
We have 

Expo = lim V V P{facetp(g) = / and dist( qp, d A(ai)iei) < — and £}. (7.13) 

m^oo ^ — ' ^ — ' rn 

For every q, there exists at most one / G (^ such that facetp(gf) = /. Furthermore, the 
last equation is equivalent to qp £ A(aj)j(=7. Hence 

^ P{facetp(g) = /} = ^ F{qp G A(a,)ie/} < 1- 
H",) Hi) 

Using this bound in (j7.13p and estimating J2q<=Tm ^-bove by m ■ max^gT,^ , we obtain 

ExpQ < lim m-pQ{m), (7.14) 

m— »oo 

where 

Po{m) = max P{dist((7p, 9 A(aj)jg/) < C/m and £ \ qp £ A(aj)jg/}. 
geT„,/e(^) 

Thus we should be looking for an estimate of the type poiin) < 1/m. 

To this end, we fix an arbitrary point q € T™ and a set I £ (^) . Consider the hyperplane 

Ej := span(aj)je/ 

and the point 

qj := point where the direction of q pierces the hyperplane Ej. 
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Then 

facetp{q)=I^ { ^ '^ (7.15) 

I all vectors {ai)i^ic are below Ei. 

We now pass to the local coordinates {bi)i£i for the hyperplane Ej, using the change of 
variables 

Qi = Rujbi + ruj, i e I, 

described in Appendix [Q 

We condition on a realization of r, lv and the vectors {ai)i,zjc. This fixes the hyperplane 
Ej and the point qj determined by r and uj. The density of the vectors {bi)i(zj is given in 
Lemma rC. II in Appendix O By (I7.15|) . we can assume that the vectors {ai)i^ic are below 
Ei. 

Let p G R"'"^ be the (fixed) representation of qi in the new variables, i.e. 

qi = RujP + ruj. 

Consider the event 

£o-={\M <2, i = l,...,n} 
By part 1 of Lemma IC.H 

BydUSI), 

facetp(g) = I -^qj £ ^{ai)iei -^ p G A(6i)ig/. 
Moreover, if p G A(6j)ig/ then ||p|| < maxjg/ ||6j||, thus 

{£ and facetp(g) = 1} Q {So and p G A(6i)ie/} ^ {\\p\\ < 2}. 

Summarizing, we have shown that 

Po("i) < maxP{dist(p, 9 A(6j)j£/) < C/m and f o | P G A(6j)jg/}, 

where the maximum is over all vectors p G M such that ||p|| < 2. We may assume that 
p = by translating all the vectors by —p. Before this translation, the densities i^j that 
make up the density of {bi)i£i in ()C.2p had centers of norm at most 1 by Lemma IC.li After 
the translation, their norms will be at most 1 + ||p|| < 3. Similarly, the constant 2 in the 
definition of Sq will change to 2 + ||p|| < 4. 

It remains to use the Distance Lemma 4.1.2 of [6]: 

Lemma 7.6 (Distance Lemma (Spielman-Teng)). Let vi,...,Vd be densities of Gaussian 
vectors in W^^^ with centers of norm at most 3 and with standard deviation a < l/3\/dlogn. 
Then, for every e > 0, we have 

P{dist(0,aff(62,...,&d)) <e and all \\b^\\ < 4} < CidV^^e, 
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where (61, ... , bd) have joint density proportional to 

d 

\A{h,...,bd)\■l[ly^{b,), 

i=l 

and Ci is an absolute constant. 

Note that dist(0, 3 A(6j)jg/) < C/m implies that there exists io £ I such that 

dist(0,aff(6i)ig/„{j(,}) < C/m. 
Since there are d choices for ig, the Distance Lemma 17.61 yields: 

Po{m) <d-Cid^a-^{C/m) < C2d^a-^/m, 
where C2 = CiC . By ()7.14p . we conclude that 

Expo = 0{d^a-^). 
This completes the proof. ■ 

8. Sections of random polytopes with an added facet 

In this section, we prove Corollary 16.31 about the size of planar sections of random poly- 
topes with an added facet. 

The main difficulty is that not all vectors {ai, . . . ^On+d) are independent. The last d 
vectors correlate with the first n vectors through the random variable Mq = e"°^^^^ , where 
M = maxj=i,..._„ ||aj||. 

This difficulty will be resolved similarly to ^. We will show that with high probability, 
Mq takes values in a set of cardinality O(loglogn). For each fixed Mq in this set, all the 
vectors oi , . . . , a^+d are independent, so we will be able to use Theorem 16.21 to get the 
desired size of the section P f] E. 

8.1. Boundedness of the random poly tope. Consider 

M = max ||aj||, M = max ||aj||. 

i=l,...,ra i=l,...,n 

Recall that by ([61]), M < 1. 
Lemma 8.1. Consider the event 

£1 := \—^^(M + aJdlogn) <M<M + 3aJdlogn], (8.1) 

•- V log n ) 

where ci = c/9, and where c is the absolute constant in Lemma \7.4\ Then 

•m < (Y " 
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Proof. Upper bound. By part 1 of Lemma 17.4^ 

P{|M - M\> 3(7y/dlogn} < { max \\ai - di\\ > Sa^/dlogn} < n~2-9'^+i < O.OOlsf '^ ) . 

i=l,...,n V"/ 

Lower bound. We consider two cases. If M > 8a^/dlogn then, using the last estimate, 
we have: 

1 - - /n^ ^^ 

P{M < -(M + aVdlogn)} < P{M < M - SaVdlogn} < 0.0015 

If M < Scj-v/filogn then, using part 2 of Lemma 17.41 and the independence of Oj, we obtain 

PJM < ^^^(M + aJdlogn)] < P{M < 9cia\/d} < (e"'^)" < 0.5( "" ) . 
Vlogn \dj 

This completes the proof. ■ 

8.2. Discretization of M. Recall that Mq = e^^°^^^\ It follows from LemmaOthat Mq 
is likely to take values in a small set. 

Indeed, if £i holds, then Mq takes values in a (non-random) set 

Mo:= {6^'°*^*^^ : M satisfies the inequalities in (f8T]) } 

of cardinality |7Wo| = O (log log n). 

Therefore, using as before that the number of edges of the polytope P O E is bounded 
by (^) , we have: 

E| edges(P nE)\<E\ edges(P n E)\ ■ Is, + I 

= ^ E| edges(P nE)\- l{Mo=m} + 1 

m&Mo 

<Cloglogn- max Eo| edges(P n P)| + 1, (8.2) 

Mo&Mo 

where Eq denotes the expectation with respect to the distribution of (ai, . . . , a„+d) as in 
the statement of Corollary 16.31 except that the value of Mq £ Mq is fixed. According to this 
new distribution, all vectors oi, . . . , On+d are now independent. 

8.3. Vertices with different standard deviations. Now we dilate the vectors Oj so that 
the centers of their distributions become at most 1. To this end, define 

bi := , i=l,...,n + d, 

C'MoVl^ 

where C = 3/ci, where ci is the constant in Lemma 18.11 Let us estimate the standard 
deviations and the centers of the distributions of 6,- . 
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Vectors 61, . . . , 6„. The standard deviation of ai, . . . , a„ is a. Hence the standard 
deviation of 61 , . . . , 6„ is 



/ 



a 



Using the lower bound in (18. ip (which holds by the definition of TWq) and the trivial in- 
equality M < Mq , we conclude that the standard deviation satisfies a condition of the type 

(|6.ip . namely 

, a a 1 , , 

a' < ; < , < — , (8.3) 

C'Mvlogn C'cicrvd log n Sydlogn 



On the other hand, since M < 1 and by the upper bound in (jS.ip . we have Mg < 3M < 



3M + 90-Vdlogn < 3 + 9/6 < 5. Thus 

a' > " (8.4) 

oC'Vlogn 

The centers of the distributions of ai, . . . ,a„ have norms at most M. Since M < Mq and 

using the lower bound in (jS.ip , we conclude that the centers of the distributions of 61 , . . . , 6„ 

have norms at most 

M M 

C'MoVlS^ ~ ciC'Mo - 

Vectors bn+i, ■ ■ ■ , bn+d- The standard deviation of a^+i, . . . , fln+d is 2MoO"i. Hence the 

standard deviation of 6„+i, . . . , bn+d is 

;/ 2(71 

a" := ' (8.5) 

Since o"i satisfies (j5.ip and C" > 1, the standard deviation cj" satisfies a condition of the 
type ()6.ip . namely 

a" < J (8.6) 

oywlogn 

The centers of the distributions of an+i, ■ ■ ■ ,an+d have norms at most 3Mq. Hence the 

centers of the distributions of bn+i, ■ ■ ■ , bn+d have norms at most 

Cylogn 
Now we apply Lemma 4.3.2 from [6] which allows us to reduce standard deviations of the 
vertices to one (minimum) value. 

Lemma 8.2 (Gaussians free). Let di,...,dn be independent Gaussian vectors in W^ with 
standard deviations cii, . . . , cr„. Let ctq > 0. AssuTne that ctq < ai < l/3\/dlogn for all i. 
Let E be a plane in M . Then the random polytope P = conv(di, . . . ,dn) satisfies 

E|edges(PnS)| < L>(d,<7o) + 1. 

where D{d,a) denotes the right hand side of (|6.2p . 
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We use this lemma for the vectors 61 , . . . , bn+d and for 

/ (7 2(Tl \ 1 -1/2/ \ / \ 

0-0 := mm I , ) > clog ' [n) ■ mm{a,ai), 

\5C'^/[ogn C'^^logn/ 

where c > is some absolute constant. 

Indeed, it follows from ([Ml) and ([83]) that ctq < min(o-', a"). Similarly, ([83]) and dM 
state that inayi{a' ,a") < l/3^/dlogn. Thus Lemma [52] applies, and it yields 

Eol edges(P n E)\ < D{d, ao) + 1. 

Using this in (|8.2|) . we conclude that 

E|edges(Pn£;)| < Cloglogn • D(d,cro) + 1. 

This completes the proof of Corollary 



A. Appendix A. Proof of Proposition 14.11 

In this section, we prove Proposition 14.11 which allows us to interpolate between an 
arbitrary linear program and a unit linear program. 

For a fixed t £ [0, 1], let (LPj) denote the interpolation program (Int LP) with this fixed 
value of t. The feasible sets (polytopes) of (LP) and of (LPj) will be denoted by P and Pt 
respectively. They are subsets of M.'^. 

A.l. Recession cone. Our proof of (i) of Proposition 14.11 will be based on an analysis of 
the recession cone, defined as 

Recess(LP) = Recess(P) := {x : Mx + P<^P for ah M > 0}. 

The polytope P is unbounded iff its recession cone is nonempty. 

Lemma A.l (Recession cone). Assume (LP) is feasible. Then 

Recess(LP) = {x : ^x < 0} = (cone(ai)"^i)° 

Proof. The second equation is trivial. 

To prove the inclusion Recess(LP) C {x : Ax < 0}, let us fix arbitrary x S Recess(P) 
and xq £ P. By the definition of the recession cone, Mx + xq £ P for all M > 0. Since P 
is the feasible polytope of (LP), we have 

A{Mx + xo) <b for ah M > 0. 

Hence 



Ax < —{b - Axo) ^ as M ^ 00. 



It follows that Ax <0. 
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To prove the inclusion Recess(LP) D {x : Ax < 0}, let us fix x such that Ax < and 
xq £ P. Since P is the feasible polytope of (LP), we have Axq < b. Then for every M > 
we have 

A{Mx + xo) = M ■ Ax + Axo < b. 

Thus Mx + xq (z P for every M > 0. Hence x € Recess(P). This completes the proof. ■ 

Lemma A. 2 (Boundedness of LP). Assume the linear program (LP) is feasible. Then (LP) 
is bounded iff 

z G (Recess(LP)) = cone(aj)"=;^. 

Proof. Necessity. Assume 2; ( Recess (LP)) . Then there exists x € Recess (LP) such 
that {z,x) > 1. Let us fix an arbitrary xq £ P. By the definition of the recession cone, 
Mx + Xq £ P for every M > 0. Therefore, the vectors Mx + xq are feasible for (LP). On 
the other hand, the objective function can take arbitrarily large values on such vectors: 

{z, Mx + Xq) = M{z, x) + {z, Xq) — > oo as M — > oo. 

Thus (LP) is unbounded. 

Sufficiency. Assume z € cone(aj)"^^. Then we can write z = Yll=i ^i'^i for some Aj > 0. 
Let X be any feasible vector for (LP), that is Ax < b. Then, the objective function is 
bounded as 

n n 

{z,x) < ^Ai(ai,x) < y^^Xjbj. 

1=1 i=l 

Thus (LP) is bounded. This completes the proof. ■ 

Remark. The main point in both lemmas is that their conclusion does not depend on 
the right hand side b of (LP). Thus, for feasible linear programs, their boundedness does not 
depend on the right hand side b. 

A. 2. Proof of Proposition 14.11 



Proof of (i). Since unbounded linear programs are feasible. Lemma lA. 21 implies that (LP) 
is unbounded iff (Unit LP) is unbounded. 

Next, if (Unit LP) is unbounded then (Int LP) is clearly unbounded for every A. Indeed, 
the feasible set of (Int LP) contains the Pq as a section (for t = 0), and Pq is the feasible 
set of (Unit LP). 

The remaining part of (i) is: if (Int LP) is unbounded for some A, then (Unit LP) is 
unbounded. We shall now prove this. 

The unboundedness of (Int LP) means that for every M >\ there exists a feasible point 
[xMitM) for (Int LP) such that 

{z,xm) + Atjv/ ^00 as M — > 00. (A.l) 
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Since tM £ [0, 1], the definition of (Int LP) yields 

AxM < tub + (1 - tAf )1 < max(||&||oo, 1) • 1 =: S • 1. (A.2) 

Writing this estimate as Ai^^xu) < 1, we see that the vectors ^xm are feasible for 
(Unit LP) for all M > 1. On the other hand, the objective function can take arbitrar- 
ily large values on such vectors. Indeed, since tM G [0, 1], estimate (jA.ip implies that 

(z, —xm) — > CO as M ^^ oo. 
B 

Thus (Unit LP) is unbounded. 

Proof of (iii). Assume that (LP) is not unbounded. 

Sufficiency. Assume (LP) is feasible. Then the interpolation (LPj) is clearly feasible for 
every < t < 1. Indeed, if x is feasible for (LP), then tx is feasible for (LPj). Similarly 
to ()A.2p . we have that Ax < B ■ \ h all feasible points x of (LPj). In terms of the feasible 
polytopes, this means that Pt '^ B ■ Pq. 

Since (LP) is not unbounded by the assumption, part (i) implies that (Unit LP) is not 
unbounded. Being always feasible (0 is a feasible set), (Unit LP) must therefore be bounded. 
Hence 

max(z, x)\ < B ■ \ max(z, x) =: Xq. (A. 3) 

xePt xePo 

We can write (Int LP) as the optimization problem 

max f(t), where f(t) =m.ax — (z,x) +t. 
te[o,i]^ ^ ^ ^ xePt \^ ' 



By (|A.3[) . we have 



It then follows that 



|/(t)-t| < Ao/A for ah iG [0,1]. 



argmax/(t) > 1 — 2Ao/A, 

tG[0,l] 

which converges to 1 as A — > cxo. 

We have shown that a solution {x\,tx) of (Int LP) with parameter A satisfies t; ^ 1 as 
A — > oo. On the other hand, {xx,tx) is a vertex of the feasible polytope of (Int LP). This 
polytope does not depend on A, thus there are finitely many choices for {xx,tx). It follows 
that ^A = 1 foi" all sufficiently large A. The sufficiency in (iii) is proved. 

Necessity. This part is trivial. Indeed, if (x, 1) is a solution of (Int LP) with some 
parameter A, then x is a solution of (LPi) and thus x is a solution of (LP). 
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Proof of (ii). Assume (LP) is not unbounded. 

Note that for all sufficiently small t, namely for t £ [0, to] with to := l/\\b — l||oo) the 
right hand side of (Int LP) is nonnegative: 

tb+{l-t)l > 0. 

Hence (LPj) is feasible for all t E [0,to] (0 is a feasible point). 

For a fixed A < 0, we can write (Int LP) as the optimization problem 

max g{t), where g{t) = max — -{z,x) — t. 

teDom(g) xi^Pt —A 

The domain T)om.{g) consists of all t £ [0, 1] for which Pt is nonempty, i.e. (LPj) is feasible. 
In particular, [0, to] ^ Doui{g). 
By (jA.Sp . we have 

\g{t) +t\< Ao/(-A) for all t G Dom{g). 

^From this and from the fact that T)oin{g) contains a neighborhood [0, to] of it follows 
that 

argmax g{t) < — 2Ao/A, 

t£Dom(g) 

which converges to as A — > — cxd. 

We have shown that a solution {xx,tx) of (Int LP) with parameter A satisfies t; ^ as 
A — > — oo. Similarly to our proof of (iii), we deduce that tx = for all sufficiently small A. 
This proves (ii). 
Proof of (iv). This follows directly from (iii) since (Int LP) for t = 1 is (LP). 

The proof of Proposition 14.11 is complete. 



B. Appendix B. Proof of Theorem 15.41 

B.l. Part 1. We need to prove that (4a) and (4b) in Adding Constraints imply that 
in the polytope P~^ = conv(0, ai, . . . ,an+d), one has facet(z) = {n+1, . . . ,n+d}. By (4a), it 
will be enough to show that all points oi, . . . , a^ lie below the affine span aff (a^+i, . . . , an+d) = 
H. Since all these points have norm at most M, it will suffice to show that all vectors x of 
norm at most M are below H. By (4b), the normal h to H has norm at most 1/M, thus 
{h,x) < 1. Thus X is indeed below H. This completes the proof. 

B.2. Part 2. Without loss of generality, we can assume that Mq = M. Also, by the 
homogeneity, we can assume that M = 1/2. Thus there is no dilation in step 2 of Adding 
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Constraints. Let if be a numb half-space. It suffices to show that 

F{zo e cone(a„+i, . . . , an+d)} > 0.99; (B.l) 

P{normal h to aff (a„+i, . . . , Un+d) satisfies \\h\\ < — } > 0.99; (B.2) 

P{an+i, . . . , an+d are in H} > 1/3. (B.3) 

Events in (IB.ip and ()B.2|) are invariant under the rotation U. So, in proving these two 
estimates we can assume that U is the identity, which means that zq = z'q and Oj = a[ 
for i = n + 1, . . . , n + d. We can also assume that d is bigger than some suitable absolute 
constant (100 will be enough). 

We will use throughout the proof the known estimate on the spectral norm of random 
matrices. Recall that the spectral norm of a d x ti matrix B is defined as 

WBrW 

II Dll K-'''" 

\\B\\ = max —r, — r— . 



a;eKd ||X|| 

For a random d x d matrix B with independent standard normal random entries, one has 
F{\\B\\ > 2t^} < 2'^{d - i)t'^-2e-'i(i^-i)/2 for t > 1 (B.4) 



see e.g. |14) . Therefore, for a random d x d matrix G with independent entries of mean 
and variance o"i, we have 

P{||G|| > 2ait^} < 2'^{d - i)t'i-2e-'^(i^-i)/2 fo^. ^ > ^ 

(this follows from (fRJl) for B = G/ai). 
In particular, 

P{||G|| < 3(71 Vd} > 0.99. (B.5) 

We will view the vectors a„+i, . . . , an+d as images of some fixed orthonormal vector basis 

Cn+i, . . . , en+d of W'. Denote 

n+d 
i=n+l 

We define the linear operator T in M°' so that 

Oj = Tcj, ai = {T + G)ei, i = n + 1, . . . ,n + d. 

We first show that 

||T"i < 1/1 (B.6) 

Indeed, A(e„4.i, . . . , e^+d) is a simplex with center d~^\ of norm d'^'"^ and radius ||d~^l — 
Cill = y^l — 1/d. Similarly, A(an+i, . . . ,«n+d) is a simplex with center zq of norm 1 and 
radius H^o ~ ^^ill = ^- Therefore we can write T = VTi with a suitable V G 0{d), and 
where T acts as follows: if x = xi + 2:2 with xi E span(l) and X2 G span(l)-'-, then 
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Tix = d^/'^xi +£(1 - l/d)-i/2^2. Thus ||r-i|| = llTf^ll = r^{l - l/d)^/"^ < 1/1 This 
proves (IB.6p . 

B.2.1. Proof of (IB.ip . An equivalent way to state (IB.1|) is that 

n+d 

zq = 2, CjOi where all c, > 0. (B.7) 

i=n+l 

Recall that ai = {T + G)ei. Multiplying both sides of (IBTI) by the operator (T + G)"\ we 
obtain 

n+d 
j=n+l 

Taking the inner product of both sides of this equation with the vectors e^, we can compute 
the coefficients Cj as 

Ci = {{T + G)-^zo,ei). (B.8) 

On the other hand, zq is the center of the simplex A(a„+i, . . . , a„+rf), so zq = Yl^=n+ii^/^)^i- 
Since ai = Tci, a similar argument shows that 

^ = {T~^zo,ei). (B.9) 

Thus to bound Cj below, it suffices to show that the right sides of (jB.SP and (|B.9P are close. 
To this end, we use the identity T'^ - {T + G)'^ = {1+T-^G)-^T-^GT-'^ and the estimate 
||1 + 'S'll ^ (1 — ll'S'll)^^ valid for operators of norm \\S\\ < 1. Thus the inequality 

ll(r + G)--r-||< ;^^^;i['.[jg^^ <l (B.io) 

holds with probability at least 0.99, where the last inequality follows from (jB.Sp . ()B.6P and 
from our choice of £ and o"i made in (j5.ip . Since zq and Cj are unit vectors, (jB.lOp implies 
that the right sides of (jB.Sp and ()B.9p are within ^ from each other. Thus Q > ^ > for 
all i. This completes the proof of (|B.ip . 

B.2.2. Proof of ()B.2p . We claim that the normal z'q to aff(a„+i, . . . ,an+d) a-nd the normal 
h to aff (a„-|-i, . . . , a„4.d) can be computed as 

4 = (r*)-il, h = {{T + Gy)-^l. (B.ll) 

Indeed, for every i G {n + 1, . . . ,n + d} we have 

Hence by the definition of the normal, the vector (T*)~^l is the normal to aff (a^+i, • • • , dn+d)- 
The second identity in (|B.lip is proved in a similar way. 

Since z'q is a unit vector, to bound the norm of h it suffices to estimate 

Wh-z'j < ||((r + G)*)-i-(r*)-i||||i|| = ||(r + G)-i-r-i||||i||. 
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By (IB.IOP and using ||1|| = d~^/^, with probability at least 0.99 one has H/i-a^oll < ^d^^^'^ < 
1. Thus \\h\\ < 2, which completes the proof of (jB.2p . 

B.2.3. Proof of ()B.3p . Let z/ be a unit vector such that the half-space is H = {x : {f^x) > 
0}. Then (|B.3P is equivalent to saying that 



(z/, aj) > 0, i = n+ 1, . . . ,n + d] > 1/3. 

We will write 

(i/, Oj) = {v, zq) + {u, ai - zq) + {v, ai - en) (B.12) 

and estimate each of the three terms separately. 

Since zq is a random vector uniformly distributed on the sphere S'^^^ , a known calculation 
of the measure of a spherical cap (see e.g. [^llj p. 25) implies that 

This takes care of the first term in ()B.12p . 
To bound the second term, we claim that 

p| max |(z^,ai-zo)|< ^1 > 0.99. (B.14) 

To prove this, we shall use the rotation invariance of the random rotation U . Without 
changing its distribution, we can compose U with a further rotation in the hyperplane 
orthogonal to Uz'q. More precisely, U is distributed identically with VW . Here W G 0{d) 
is a random rotation; denote zq := Wzq. Then V is a random rotation in L = span(zo)"'" 
and for which L is an invariant subspace, that is Vzq = zq. 

Then we can write Ui — zq = V£i, where ii := W {a'^ — z'q) = Wa[ — zq. The vectors ii 
are in L because {ii,zo) = {W{a[ — Zq),Wzq) = {a[ — Zq,Zq) = since Zq is a unit vector 
and, moreover, the normal of aff(aj). Since L is an invariant subspace of V, it follows that 
Vii G L. Furthermore, \\£i\\ = \\a[ - Zq\\ = t 

Let Pl denote the orthogonal projection onto L. Then P^v \s a, vector of norm at most 
one, so denoting v' = Plu/WPluW we have 

|(z.,a, -zo)| = \{y,Vii)\ = \{PLV,Vli)\ = \{V*PLy,li)\ < \{V*v'4i)\ 

V*u' is a random vector uniformly distributed on the sphere of -L, and £j are fixed vectors 
in L of norm L 

Then to prove ()B.14p it suffices to show that for x uniformly distributed on S'^^'^ and for 
any fixed vectors £i, . . . , ^^ in M of norm I, one has 

p| max \{xji)\ < ^} > 0.99. (B.15) 
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This is well known as the estimate of the mean width of the simplex. Indeed, for any choice 
of unit vectors /ii, . . . , /i^ in M''^-'^ and any s > 0, 

p{m„|(..,.)|>^}<X:P{l{..'..>l>^} 

and each probability in the right hand side is bounded by p := exp(— (d — 3)^s^/4(i) by the 
concentration of measure on the sphere (see [TT] (l-l))- We apply this for hi = j£i and with 
s = j2ol^ which makes p < Yofj^- This implies (jB.lSh and, ultimately, (|B.14[) . 

To estimate the third term in ()B.12p . we can condition on any choice of U, so that Oj 
become fixed. Then gi = — (z^, Oj — Hi) are independent Gaussian random variables with 
mean and variance ai < — ^ =: s. Then 

1 1 

IP{5i > s} < ^=exp(-sV2cTi) < 



V2^ "^ '^^-lOOd 
by a standard estimate on the Gaussian tail and by our choice of o"i and s. Hence 

^ n+d 

p| min (z/, Oi - a^) > ^) = 1 - y^ Pl^,^ > s} > 0.99. (B.16) 

I i=n+l,...,n+d l20Vd^ .,, 

j=n+l 

Combining (jB.lSp . ()B.14p and ()B.16p . we can now estimate ()B.12p : 

1 1 

¥{{u,ai)>0, i = n + 1, . . . , n + 4 > - - 0.1 - 0.01 - 0.01 > -. 



This completes the proof of Theorem [57 



C. Appendix C. Change of variables 

The following basic change of variables is useful when dealing with the hyperplane E 
spanned by linearly independent ai, . . . , 0,^ in M.'^. It is explained in more detail in [6]. 

We specify E by choosing r G M+ and uj E S in such a way that {uj,ai) = r for all i. 
Thus io is the unit vector in the direction orthogonal to E, and r is the distance from the 
origin to E. 

We choose a reference unit vector h in M'^. The hyperplane H orthogonal to h will be 
identified with M . For every to ^ —h, we denote by R^j the linear transformation that 
rotates /i to a; in the two-dimensional subspace through h and uj and that is the identity in 
the orthogonal subspace. Then one can map a point b G M*^^^ to a € -E by a = i?^6 + ruj. 

Let 6i, . . . , 6„ be vectors in M"'^-'^, and ai, . . . , a^ be the corresponding vectors in M'^ under 
this change of variables, i.e. 

ai = Ru^bi + ruj , i = l,...,d. (C.l) 
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The Jacobian of this change of variables (w, r, 61, . . . , 6^) 1— > (oi, . . . , 0^) equals 

(d-l)!Vol(A(6i,...,&rf)). 

This is a well known formula in the integral geometry due to Blaschke (see |13|). 

Lemma C.l (Change of variables). Let ai, . . . ,ad be independent Gaussian vectors in M 
with centers of norm, at most 1 and standard deviation a. Let E = aff(ai, . . . ,ad)- Then 
the vectors 61, . . . , 6^ obtained by change of variables (jC.ip satisfy the following. 

1. \\bi\\ < \\ai\\ for all i. 

2. Let us condition on a realization ofr and to (i.e. on E). Then the density 0/(61, ... , bd) 
is proportional to 

d 

Vol(A(6i,...,6rf)) J]z.,(6,), (C.2) 

1=1 

where Vi are the densities of Gaussian vectors in M.'^^^ with centers of norm at most 1 and 
standard deviation a. 

Proof. 1. First note that a, — ruj is orthogonal to cv. Indeed, (oj — ruj, to) = (aj, w) — r = 0. 
Therefore 

II ii2 II ii2iii ii2 iiT^i,ii2( 2 iiLii2i 2 

WO-iW = Wo-i — f^W +1^1^11 = ll^wOill +^ = IPill +r . 

This proves part 1. 

2. The density of (oi, . . . , o^) is ni=i l^i{0"'i)^ where /ij are Gaussian densities. Denote the 
center of the distribution of Oj by Oj. Let Pe denote the orthogonal projection in M'^ onto 
E. 

Since the realization of E is fixed, the induced distribution /ij(aj|aj G i?) is a distri- 
bution of a Gaussian vector in E with center Pgaj and standard deviation a. (See e.g. 
Proposition 2.4.3 in [6].) 

Since the change of variables (IC.ip is an isometry M — > E, part 2 is proved except the 
upper bound 1 the norms of the centers of 6j. These centers, denoted by bi, are the vectors 
in W^~^ that correspond to PgOj under change of variables (jC.ip : 

PEOi = Rj)i + ru. 

Since i? = {x G M : (x, w) = r}, we have PgO = ruj. Therefore 

ll^ill = \\Rujh\\ = \\PEai - ruj\\ = WPECLi - Pe^\\ < \\ai - 0|| = ||aj||. 

This proves part 2 of the lemma. ■ 
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